Maintaining Unstructured Case Bases
نویسندگان
چکیده
With the dramatic proliferation of case based reasoning sys tems in commercial applications many case bases are now becoming legacy systems They represent a signi cant portion of an organization s assets but they are large and di cult to maintain One of the contribut ing factors is that these case bases are often large and yet unstructured they are represented in natural language text Adding to the complexity is the fact that the case bases are often authored and updated by di er ent people from a variety of knowledge sources making it highly likely for a case base to contain redundant and inconsistent knowledge In this paper we present methods and a system for maintaining large and unstructured case bases We focus on two di cult problems in case base maintenance redundancy and inconsistency detection These two problems are particularly pervasive when one deals with an unstructured case base We will discuss both algorithms and a system for solving these problems As the ability to contain the knowledge acquisition problem is of paramount importance our methods allow one to express relevant domain expertise for detecting both redundancy and inconsistency nat urally and e ortlessly Empirical evaluations of the system prove the e ectiveness of the methods in several large domains
منابع مشابه
Redundancy Detection in Semistructured Case Bases
ÐWith the dramatic proliferation of case-based reasoning systems in commercial applications, many case bases are now becoming legacy systems. They represent a significant portion of an organization's assets, but they are large and difficult to maintain. One of the contributing factors is that these case bases are often large and yet unstructured or semistructured; they are represented in natura...
متن کاملRedundancy and Inconsistency Detection in Large and Semi-structured Case Bases
With the dramatic proliferation of case based reasoning systems in commercial applications, many case bases are now becoming legacy systems. They represent a significant portion of an organization’s assets, but they are large and difficult to maintain. One of the contributing factors is that these case bases are often large and yet unstructured or semi-structured; they are represented in natura...
متن کاملCASE-QA: Context and Syntax embeddings for Question Answering On Stack Overflow
Question answering (QA) systems rely on both knowledge bases and unstructured text corpora. Domain-specific QA presents a unique challenge, since relevant knowledge bases are often lacking and unstructured text is difficult to query and parse. This project focuses on the QUASAR-S dataset (Dhingra et al., 2017) constructed from the community QA site Stack Overflow. QUASAR-S consists of Cloze-sty...
متن کاملTowards the use of case properties for maintaining case based reasoning systems
Because of the importance of maintenance in the realm of case–based reasoning systems, methods of maintaining case bases using case properties will be presented. The necessary notation is given, along with definitions of the properties themselves, which are correctness, consistency, incoherence, minimality, and uniqueness. Use of these properties in five experiments is explained, and the result...
متن کاملFLORA – Publishing Unstructured Financial Information in the Linked Open Data Cloud
In the world, where computers assist humans in information processing in almost every aspects of our lives, there are still huge gaps of unsurveyed areas, where data exists in an unstructured or unprocessable form limiting its usefulness and requiring extra human effort. Many times such data is extremely useful for many parties, as is the case of financial data. This paper describes an ongoing ...
متن کامل